PyDigger - unearthing stuff about Python


NameVersionSummarydate
shtec-rlhf 0.0.3.dev0 shtec-rlhf: Safe Reinforcement Learning from Human Feedback 2024-05-20 15:34:00
trl 0.8.4 Train transformer language models with reinforcement learning. 2024-04-17 15:16:50
hourdayweektotal
4413779840213015
Elapsed time: 1.31354s